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ANTIMICROBIAL PROTEINS FROM ARALIA AND IMPATIENS. 

This invention relates to antimicrobial 
proteins, processes for their manufacture and use 
% and DNA sequences encoding them. m particular it 

^ relates to antimicrobial proteins capable of being 

isolated from seeds of Aralia or ImE^iens. 

In this context, antimicrobial proteins are 
defxned as proteins or peptides possessing at least 
one of the following activities: antifungal 
activity (which may include anti-yeast activity) • 
antibacterial activity. Activity includes a range 
of antagonistic effects such as partial inhibition 
or death. Antimicrobial proteins may be oligomeric 
or may be single peptide units. 

The genus AraJLU is part of the Araliaceae, , 
medxum-sized plant family whose best known members 
are xvy and ginseng. Medicinal extracts have been 
obtaxned from some Aralia species, such as Aralia 
cordata . " - 

The genus Impatient is part of the 
Balsaminaceae plant family. There are f ±ve fc 
to sxx hundred Impat^ens species, many of which are 
commercially cultivated as greenhouse or pot 
plants. 

Plants produce a wide array of antifungal 
compounds to combat potential invaders and over the 
last ten years it has become clear that proteins 
wxth antifungal activity form an important part of 
these defences. Several classes of such proteins 
have been described including thionins, beta-i 3 
glucanases, ribosome- inactivating proteins, 
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zeamatins, chitin-binding lectins and chitinases. 
These proteins have gained considerable attention 
as they could potentially be used as biocontrol 
agents .. 

# 

Antimicrobial proteins with activity against ^ 

plant pathogenic fungi have been isolated from 

certain plant species. We have previously 

described the structural and antifungal properties 

of several such proteins, including: 

Mj-AMPl and Mj -AMP2 from Mirabilis ialapa 

seeds (Cammue BPA et al f 1992, J Biol Chem, 

267:2228-2233; International Application 

Publication Number W092/15691) ; 

Ac-AMPl and Ac-AMP2 from Amaranthus caudatus 

seeds (Broekaert WF et al, 1992, Biochemistry, 

37:4308-4314; International Application Publication 

Number W092/21699) ; 

Ca-AMPl from Capsicum annuum , Bm-AMPl from 

Briza maxima , Da-AFP from Delphinium , Cr-AFP from 
Catapodium , Ba-AFP from Baptisia and Ml -AFP from 

Microsensis (International Patent Application 
Publication Number W094/11511) ; 

Rs-AFPl and Rs-AFP2 from Raphanus sativus 
seeds (Terras FRG et al, 1992, J Biol Chem, 
267:15301-13309) and related proteins such as 
Bn-AFPl and Bn-AFP2 from Brassica napus , Br-AFPl 
and Br-AFP2 from Brassica rapa , Sa-AFPl and Sa-AFP2 
from Sinapis alba , At-AFPl from Arabidopsis 
thaliana , Dm-AMPl and Dm-AMP2 from Dahlia merckii , 
Cb-AMPl and Cb-AMP2 from Cnicus benedictus . Lc-AMP 
from Lathvrus cicera, Ct-AMPl and Ct-AMP2 from 
Clitoria ternatea, Rs-nsLTP from Raphanus sativus 
(International Patent Application Publication 
Number WO93/05153) . 

These and other plant-derived antimicrobial 
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proteins are useful as fungicides or antibiotics, 
particularly for agricultural purposes. The 
proteins may be applied to or around a plant or may 
be expressed within a plant. 

We have now purified new potent antimicrobial 
proteins . 

According to the present invention, there is 
provided an antimicrobial protein of about 3 kDa, 
capable of being isolated from seeds of Aral i a or 
Impatiens . 

We have purified two new antimicrobial 
proteins from seeds of Aralia chinensis, hereafter 
called Arc-AMPl (Aralia chinensis - antimicrobial 
protein 1) and Arc-AMP2 (Aralia ehinenai « - 
antimicrobial protein 2) 

We have also purified four new antimicrobial 
proteins from seeds of Impatiens balsamin, . 
hereafter called Ib-AMPl (ImpaM^ balsamic - 
antimicrobial protein 1) , lb-AMP2 ( Impatiens 
balsamina - antimicrobial protein 2) , Ib-AMP3 
(Irmoatiens balsamina - antimicrobial protein 3) and 
Ib-AMP4 ( Impatien* balsamina - antimicrobial 
protein 4) . 

An antimicrobial protein according to the 
invention is capable of being isolated from seeds 
of Aralia or Impatiens , and may also be capable of 
isolation from the seeds of both related and 
unrelated species,, or may be produced or 
synthesised by any suitable method. 
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According to the present invention, there is 
further provided an antimicrobial protein having an 
amino acid sequence which is substantially 
homologous to a sequence selected from the group 
consisting of SEQ ID NO 10, SEQ ID NO 11, SEQ ID NO 
12 and SEQ ID NO 13 . A sequence is "substantially 
homologous" if it has at least 60% sequence 
identity with any one of the SEQ ID NOs 10 to 13 
and it encodes a protein having antimicrobial 
activity. 

The antimicrobial protein may be extracted and 
purified from plant material, manufactured from its 
known amino acid sequence by chemical synthesis 
using a standard peptide synthesiser, or produced 
within a suitable organism (for example, a 
micro-organism or plant) by expression of 
recombinant DNA. The antimicrobial protein is 
useful as a fungicide or an antibiotic and may be 
used for agricultural or pharmaceutical 
applications . 

Amino acid sequencing of the Ib-AMPs is 
described in Examples 7 and 8. The amino acid 
sequence of Ib-AMPl is shown as SEQ ID NO 10; the 
amino acid sequence of Ib-AMP2 is shown as SEQ ID 
NO 11; the amino acid sequence of Ib-AMP3 is shown 
as SEQ ID NO 12; the amino acid sequence of 
Ib-AMP4 is shown as SEQ ID NO 13. The four Ib-AMPs 
are very close homologues of eachother, but have no 
significant homology to the sequences of known 
proteins . 

Knowledge of its primary structure enables 
manufacture of the antimicrobial protein, or parts 
thereof, by chemical synthesis using a standard 
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peptide synthesiser. it also enables production of 
DNA constructs encoding the antimicrobial protein. 

The invention further provides a DNA sequence 
encoding an antimicrobial protein according to the 
invention. The DNA sequence may be a cDNA sequence 
or a genomic sequence, and may be derived from a 
cDNA clone, a genomic DNA clone or DNA manufactured 
using a standard nucleic acid synthesiser. 

The DNA sequence may be predicted from the 
known amino acid sequence and DNA encoding the 
protein may be manufactured using a standard 
nucleic acid synthesiser. Alternatively, the DNA 
sequence may be isolated from plant-derived DNA 
libraries. Suitable oligonucleotide probes may be 
derived from the known amino acid sequence and used 
to screen a cDNA library for cDNA clones encoding 
some or all of the protein. cloning of the Ib-AMP 
CDNA is described in Example 8. Oligonucleotide 
probes or cDNA clones may be used to isolate the 
actual antimicrobial protein gene(s) by screening 
genomic DNA libraries. Such genomic clones may 
mclude control sequences operating in the plant 
genome. Thus it is also possible to isolate 
promoter sequences which may be used to drive 
expression of the antimicrobial (or other) 
proteins. These promoters may be particularly 
responsive to environmental conditions (such as the 
presence of a fungal pathogen) , and may be used to 
drive expression of any target gene. 

The DNA sequence encoding the antimicrobial 
protein may be incorporated into a DNA construct or 
vector in combination with suitable regulatory 
sequences (promoter, terminator, etc) . The DNA 
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sequence may be placed under the control of a 
constitutive or an inducible promoter (stimulated 
by, for example, environmental conditions, presence 
of a pathogen, presence of a chemical) . Such a DNA 
construct may be cloned or transformed into a 
biological system which allows expression of the 
encoded protein or an active part of the protein. 
Suitable biological systems include micro-organisms 
(for example, bacteria such as Escherichia coli , 
Psfiudomonas and endophytes such as Clavibacter xvli 
subsp. r- ynodontis (Cxc) ; yeast; viruses; 
bacteriophages; etc), cultured cells (such as 
insect cells, mammalian cells) and plants. In some 
cases, the expressed protein may subsequently be 
extracted and isolated for use. 

An antimicrobial protein according to the 
invention is useful as a fungicide or an 
antibiotic. The invention further provides a 
process of combating fungi or bacteria whereby they 
are exposed to an antimicrobial protein according 
to the invention. 

For pharmaceutical applications, the 
antimicrobial protein may be used as a fungicide or 
anti-bacterial to treat mammalian infections (for 
example, to combat yeasts such as Candida) . 

An antimicrobial protein according to the 
invention may also be used as a preservative (for 
example, as a food additive) . 

For agricultural applications, the 
antimicrobial protein may be used to improve the 
disease-resistance or disease- tolerance of crops 
either during the life of the plant or for 
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post-harvest crop protection. Pathogens exposed to 
the proteins are inhibited. The antimicrobial 
protein may eradicate a pathogen already 
established on the plant or may protect the plant 
from future pathogen attack. The eradicant effect 
of the protein is particularly advantageous. 

Exposure of a plant pathogen to an 
antimicrobial protein may be achieved in various 
ways, for example: 

(a) a composition comprising the isolated 
protein may be applied to plant parts or the 
surrounding soil using standard agricultural 
techniques (such as spraying) ; the protein may have 
been extracted from plant tissue or chemically 
synthesised or extracted from micro-organisms 
genetically modified to express the protein; 

(b) a composition comprising a micro-organism 
genetically modified to express the antimicrobial 
protein may be applied to a plant or the soil in 
which a plant grows; 

(c) an endophyte genetically modified to 
express the antimicrobial protein may be introduced 
xnto the plant tissue (for example, via a seed 
treatment process) ; 

[An endophyte is defined as a micro-organism having 
the ability to enter into non-pathogenic 
endosymbiotic relationships with a plant host . A 
method of endophyte-enhanced protection of plants 
has been described in a series of patent 
applications by Crop Genetics International 
Corporation (for example, International Application 
Publication Number WO90/13224, European Patent 
Publication Number EP-125468-B1 , International 
Application Publication Number WO91/10363, 
International Application Publication Number 
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WO87/03303) . The endophyte may be genetically 
modified to produce agricultural chemicals, 
international Patent Application Publication Number 
WO94/16076 (ZENECA Limited) describes the use of 
endophytes which have been genetically modified to 
express a plant-derived antimicrobial protein] . 

(d) DNA encoding an antimicrobial protein may 
be introduced into the plant genome so that the 
protein is expressed within the plant body (the DNA 
may be cDNA, genomic DNA or DNA manufactured using 
a standard nucleic acid synthesiser) . 

Plant cells may be transformed with 
recombinant DNA constructs according to a variety 
of known methods (ftgrobacterium Ti plasmids, 
electroporation, microinjection, microprojectile 
gun, etc) . The transformed cells may then in 
suitable cases be regenerated into whole plants in 
which the new nuclear material is stably 
incorporated into the genome. Both transformed 
monocotyledonous and dicotyledonous plants may be 
obtained in this way, although the latter are 
usually more easy to regenerate. Some of the 
progeny of these primary transf ormants will inherit 
the recombinant DNA encoding the antimicrobial 
protein (s) . 

The invention further provides a plant having 
improved resistance to a fungal or bacterial 
pathogen and containing recombinant DNA which 
expresses an antimicrobial protein according to the 
invention. Such a plant may be used as a parent in 
standard plant breeding crosses to develop hybrids 
and lines having improved fungal or bacterial 
resistance . 
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Recombinant DNA is heterologous DNA which has 
been introduced into the plant or its ancestors by 
transformation. The recombinant DNA encodes an 
antimicrobial protein expressed for delivery to a 
site of pathogen attack (such as the leaves) . The 
DNA may encode an active subunit of an 
antimicrobial protein. 

A pathogen may be any fungus or bacterium 
growing on, in or near the plant. m this context 
improved resistance is defined as enhanced 
tolerance to a fungal or bacterial pathogen when 
compared to a wild-type plant. Resistance may vary 
from a slight increase in tolerance to the effects 
of the pathogen (where the pathogen in partially 
inhibited) to total resistance so that the plant is 
unaffected by the presence of pathogen (where the 
pathogen is severely inhibited or killed) An 
increased level of resistance against a particular 
pathogen or resistance against a wider spectrum of 
pathogens may both constitute an improvement in 
resistance. Transgenic plants (or plants derived 
therefrom) showing improved resistance are selected 
following plant transformation or subsequent 
crossing. 

Where the antimicrobial protein is expressed 
within a transgenic plant or its progeny, the 
fungus or bacterium is exposed to the protein at 
the site of pathogen attack on the plant. In 
particular, by use of appropriate gene regulatory 
sequences, the protein may be produced in vivo when 
and where it will be most effective. For example 
the protein may be produced within parts of the 
plant where it is not normally expressed in 
quantity but where disease resistance is important 



BNSDOC1D: <WO 9524486A1 _!_> 



PCT/GB95/00509 

(such as in the leaves) . 

Examples of genetically modified plants which 
may be produced include field crops, cereals, fruit 
and vegetables such as: canola, sunflower, tobacco, 
sugarbeet, cotton, soya, maize, wheat, barley, 
rice, sorghum, tomatoes, mangoes, peaches, apples, 
pears, strawberries, bananas, melons, potatoes, 
carrot, lettuce, cabbage, onion. 

As the antimicrobial proteins of the 
invention are very active against some of the major 
maize pathogens, it would be particularly 
advantageous to transform maize plants with 
constructs encoding said proteins. Alternatively, 
the proteins may be supplied to maize plants by any 
other suitable method. 

A further aspect of this invention relates 
generally to the expression of "polyproteins" in 
transgenic plants. A "polyprotein" is defined as 
two or more peptides linked together to form a 
single translation product. The component peptides 
are separated by cleavage sites whereby the 
expressed polyprotein is post-translationally 
processed into the component molecules. Such 
cleavage is achieved by the action of proteases or 
by self -processing of the polyprotein. 

The relative levels of expression of several 
introduced genes in transgenic plants is 
notoriously influenced by "position effects 1 * 
determined by the particular site of transgene 
integration into the genome. Even when introduced 
genes are linked on the same T-DNA, driven either 
by convergent or divergent promoters, they are 



WO 95/24486 




BNSDOCID: <WO 9S24486A1J_> 



on 



WO 95/24-486 ^_ 

PCT/GB95/00509 

11 

U3ually not co-ordinately expressed at similar 
levels. This poses particular problems when high 
level expression of a number of introduced 
activities is required, for instance when 
attempting to express novel biochemical pathways in 
plants. In an attempt to achieve tissue specific, 
co-ordinated expression of two proteins, 
researchers have linked genes by co- transference 
the same T-DNA but expression levels were found to 
vary independently. Another strategy was to link 
genes via adjacent and divergent promoters, but 
consistently co-ordinated expression was not 
obtained . 

Linking proteins in the form of polyproteins 
xs a strategy adopted in the replication of many 
viruses. On translation, virus-encoded proteinases 
mediate extremely rapid intramolecular (cis) 
cleavages of the polyprotein to yield discrete 
protein products. International Patent Application 
Number PCT/GB94/02765 (filed on 19 December 1994) 
describes a method for the expression of multiple 
proteins in a transgenic plant comprising inserting 
mto the genome of the plant a gene construct 
comprising a 5' -region which includes a promoter 
which is capable of initiating transcription of a 
structural gene under the control thereof, a 
protein encoding sequence coding for more 'than one 
protein and a 3 ' -terminator region which includes a 
polyadenylation signal, each of the said protein 
encoding sequences being separated from an adjacent 
protein encoding sequence by a DNA sequence which 
on translation provides a cleavage site whereby the 
expressed polyprotein is post- translationally 
processed into the component protein molecules 
Preferably the DNA sequence which encodes the post- 
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translation cleavage site is derived from a virus, 
particularly a picornavirus such as a Foot -and - 
Mouth Disease (FMD) virus. Thus multiple genes are 
inserted into a plant genome under the control of a 
single promoter, in the form of a self -processing 
polyprotein. The inclusion of proteinase or 
cleavage sequences in plant transformation 
constructs enables the expression from a single 
promoter of multiple introduced proteins, initially 
linked as a polyprotein, in plant cells and plants. 

In work leading to the present invention, we 
have shown that the four Ib-AMP proteins are 
encoded by a single gene (SEQ ID NO 8 , Example 8) . 
The Ib-AMP gene sequence has an open reading frame 
of 333 amino acids (a polyprotein) containing six 
homologous repeats encoding all four of the 
isolated ib-AMPs . The gene also contains seven 
propeptide domains which are removed during 
processing of the precursor protein: five of these 
domains (SEQ ID NO 14 to SEQ ID NO 18) are spacers 
which lie between the AMP-encoding regions; one of 
the domains (SEQ ID NO 20) lies at the C- terminus 
of the protein; one of the domains (SEQ ID NO 19) 
lies at the N- terminus of the protein and is linked 
to a signal sequence (SEQ ID NO 21) of 
approximately 25 amino acids. The structure of the 
Ib-AMP gene is shown in Figure 10 and described in 
Example 8. In plants only two examples are known 
of multidomain precursors which are cleaved into 
near identical parts, namely polyubiquitin 
(Christensen et al , Plant Mol Biol, 18:675-689) and 
a proteinase inhibitor from Nicotiana alata 
(Atkinson et al, Plant Cell, 5:203-213). 
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According to a further aspect of the 
invention, there is provided a method for the 
expression of multiple proteins in a transgenic 
plant comprising inserting into the genome of the 
plant a gene construct comprising a 5' -region which 
includes a promoter which is capable of initiating 
transcription of a structural gene under the 
control thereof, at least two protein encoding 
sequences and a 3 ' -terminator region which includes 
a polyadenylation signal, each of the said protein 
encoding sequences being separated from an adjacent 
protein encoding sequence by a DNA sequence which 
on translation provides a cleavage site whereby the 
expressed polyprotein is post-translationally 
processed into the component protein molecules, 
wherein at least one of the DNA sequences providing 
the cleavage site encodes an amino acid sequence 
selected from the group consisting of SEQ id NO 14, 
SEQ ID NO 15, SEQ ID NO 16, SEQ ID NO 17 and SEQ ID 
NO 18. It is well known that variations may be 
made in amino acid sequences which do not greatly 
affect function and it is intended that such 
variants of the said sequence and the nucleotide 
which encodes it are within the scope of this 
invention. The DNA sequences providing the 
cleavage site may be derived from SEQ ID NO 8. 

Insertion of any one of the five Ib-AMP spacer 
propeptide domains (SEQ ID NO 14 to SEQ ID NO 18) 
to link protein encoding sequences allows the 
engineering of plant expression vectors where 
multiple whole proteins or protein domains can be 
expressed as a polyprotein and cleaved apart co- 
translationally with high efficiency. Use of the 
Eiant-derived spacer sequences facilitates 
processing of the polyprotein within transgenic 
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plant tissue. The other propeptide domains (SEQ ID 
NO 19, SEQ ID NO 20) and the signal peptide (SEQ ID 
NO 21) from the lb -AMP gene may also be 
incorporated into the plant expression vector. 

Thus the Ib-AMP gene arrangement may be used 
in the expression of other peptides or proteins 
(including other antimicrobial peptides) by using 
the conserved spacer propeptide domains to 
construct an artificial gene encoding multimers of 
a given protein. The protein components of the 
polyprotein may be identical, thus increasing 
expression of said protein through a type of 
-gene -dosage" effect. Alternatively, two or more 
different protein components may be linked in one 
polyprotein, enabling co-ordinated expression of 
the different proteins. For example, this may 
enable the rapid introduction of entire enzyme 
cascades into plants . 

The invention will now be described by way of 
example only with reference to the drawings, in 
which : 

Figure 1 shows the cation exchange 
chromatogram for purification of the Ib-AMPs . 

Figure 2 shows the reversed phase chromatogram 

for purified Ib-AMPl . 

Figure 3 shows the reversed phase chromatogram 

for purified Ib-AMP2 . 

Figure 4 shows the reversed phase chromatogram 

for purified Ib-AMP3 . 

Figure 5 shows the reversed phase chromatogram 
for purified Ib-AMP4 . 

Figure 6 shows the cation exchange 
chromatogram for purification of the Arc-AMPs. 

Figure 7 shows the reversed phase chromatogram 
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for purified Arc-AMPl. 

Figure 8 shows the reversed phase chromatogran 
for purified Arc-AMP2 . 

Figure 9 shows the amino acid sequences of 
peptide fragments of the Ib-AMPs. 

Figure 10 shows the sequence of the Ib-AMP 
cDNA and the encoded protein; 

Figure 11 is a diagram showing the structure 
of the vector pIB6; 

Figure 12 is a diagram showing the structure 
of the vector pBinIB6 . 

The invention is also described with reference 
to the SEQUENCE LISTING in which: 

SEQ ID NO 1 is the partial amino acid sequence 
ot Ib-AMPl shown in Figure 9; 

SEQ ID NO 2 is the partial amino acid sequence 
of Ib-AMP2 shown in Figure 9 ;; 

SEQ ID NO 3 is the partial amino acid sequence 
of Ib-AMP3 shown in Figure 9 ;; 

SEQ ID NO 4 is the partial amino acid sequence 
of Ib-AMP4 shown in Figure 9; 

SEQ ID NO 5 is the amino acid sequence of a 
region of the Ib-AMP sequences,- 

SEQ ID NO 6 is the nucleotide sequence of the 
oligonucleotide IbAMPl-C; 

SEQ ID NO 7 is the nucleotide sequence of the 
oligonucleotide IbAMPl-B; 

SEQ ID NO 8 is the nucleotide sequence of the 
Ib-AMP cDNA as shown in Figure 10; 

SEQ ID NO 9 is the predicted ' amino acid 
sequence of the protein encoded by the Ib-AMP cDNA 
as shown in Figure 10/ 

SEQ' ID NO 10 is the complete amino acid 
sequence of Ib-AMPl ; 

SEQ ID NO 11 is the complete amino acid 
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sequence of Ib-AMP2; 

SEQ ID NO 12 is the complete amino acid 
sequence of Ib-AMP3; 

SEQ ID NO 13 is the complete amino acid 
sequence of Ib-AMP4 . 

SEQ ID NO 14 is the amino acid sequence of an 
Ib-AMP propeptide spacer domain; 

SEQ ID NO 15 is the amino acid sequence of an 
Ib-AMP propeptide spacer domain; 

SEQ ID NO 16 is the amino acid sequence of an 
Ib-AMP propeptide spacer domain; 

SEQ ID NO 17 is the amino acid sequence of an 
Ib-AMP propeptide spacer domain; 

SEQ ID NO 18 is the amino acid sequence of an 
Ib-AMP propeptide spacer domain; 

SEQ ID NO 19 is the amino acid sequence of the 
Ib-AMP N-terminal propeptide spacer domain; 

SEQ ID NO 20 is the amino acid sequence of the 
Ib-AMP C- terminal propeptide spacer domain; 

SEQ ID NO 21 is the amino acid sequence of the 
Ib-AMP signal peptide. 



EXAMPLE 1 

Antifungal and antibacterial activity assays. 

Antifungal activity was measured by 
microspectrophotometry as previously described 
(Broekaert, 1990, FEMS Microbiol Lett, 69:55-60). 
Routinely, tests were performed with 20 pi of a 
(filter-sterilized) test solution and 80 pi of a 
suspension of fungal spores (2 x 10 4 spores/ml) in 
Medium A (half strength potato dextrose broth or 
1/2 PDB) or Medium B (1/2 PDB supplemented with ImM 
CaCl and 50 mM KC1) . Control microcultures 
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contained 20 ^1 of sterile distilled water and 80 
til of the fungal spore suspension. 

Unless otherwise stated the test organism was 
Fusarium culmornm (strain IMI 180420) and 
incubation was done at 25 °C for 4 8 hours. Percent 
growth inhibition is defined as 100 times the ratio 
of the corrected absorbance of the control 
microculture minus the corrected absorbance of the 
test microculture over the corrected absorbance at 
595 nm of the control microculture. The corrected 
absorbance values equal the absorbance at 595 nm of 
the culture measured after 48 hours minus the 
absorbance at 595 nm measured after 30 min. 

Antibacterial activity was measured 
microspectrophotometrically as follows. A 
bacterial suspension was prepared by inoculating 
soft nutrient agarose (tryptone, 10 g/l ; Seaplaque 
agarose (FMC) , 5 g/1). Aliquots (80 M l) of the 
bacterial suspension (10 5 colony forming units per 
ml) were added to filter-sterilized samples (20 M l) 
in flat-bottom 96-well microplates. The absorbance 
at 595 nm of the culture was measured with the aid 
of a microplate reader after 30 minutes and 24 
hours of incubation at 28°C. Percent growth 
inhibition was calculated as described above for 
the antifungal activity assay. 



EXAMPLE 2 

Purification of antimicrobial proteins from 
Impatiens balaamina seeds. 

Five hundred grammes of I balsamic see ds 
(purchased from Chiltern Seeds, Cumbria, UK) was 
ground in a coffee mill and the resulting meal was 
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extracted for 2 hours at 4°C with 2 litres of an 
ice-cold extraction buffer containing 10 mM 
NaH 2 P0 4 , 15 mM Na 2 HP0 4 , 100 mM KCl , 2 mM EDTA and 1 
mM benzamidine. The resulting homogenate was 
squeezed through cheesecloth and clarified by 
centrifugation (30 min at 7,000 x g) . Solid 
ammonium sulphate was added to the supernatant to 
obtain 75% relative saturation and the precipitate 
allowed to form by standing overnight at 4°C. 
Following centrifugation at 7,000 x g for 30 
minutes, the precipitate was redissolved in a 
minimal volume of distilled water and dialyzed 
extensively against distilled water using 
benzoylated cellulose tubing (Sigma, St Louis, MO) • 
After dialysis the solution was adjusted 
to 50 mM (NH 4 )Ac (pH 9) by addition of the ten- fold 
concentrated buffer, and subsequently passed over a 
Q-Sepharose Fast Flow (Pharmacia, Uppsala, Sweden) 
column (12 x 5 cm) in equilibrium with 50 mM NH^Ac 
(pH 9) . The basic protein fraction which passed 
through the column was adjusted to pH6 with acetic 
acid and further purified by cation exchange 
chromatography as described below. 

Approximately 500 ml of the basic protein 
fraction was applied on a S-Sepharose High 
Performance (Pharmacia) column (10 x 1.6 cm) 
previously equilibrated with 50 mM NH^Ac buffer (pH 
6.0) . The column was eluted at 3 ml\min with a 
linear gradient of 50-750 mM NH^Ac (pH 6) over 325 
minutes. The eluate was monitored for protein by 
online measurement of the absorbance at 28 0 nm and 
collected in 10 ml fractions. Samples from each 
fraction were assayed for antifungal activity as 
described in Example 1. Results are shown in 
Figure 1, with the active peaks shaded black. 
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Following chromatography, the extract yielded 
four peaks of activity eluting at between 400 mM 
and 700 mM NH^Ac . Fractions from each peak showing 
antifungal activity were pooled and further 
purified by reverse-phase HPLC. About 3 tng amounts 
of each peak were loaded on a Pep-S (porous silica 
C 2 //C 18' Pna rmacia) column (25 x 0.93 cm) 
equilibrated with 0.1% TFA ( trif luoracetic acid). 
The column was developed at 1 ml/min with a linear 
gradient of 0.1% TFA to 100% acetonitrile/0 . 1% TFA 
over 65 minutes. The eluate was monitored for 
protein by online measurement of the absorption at 
210 nm. 



The results for peaks 1, 2, 3 and 4 are shown 
in Figures 2, 3, 4 and 5, respectively. One ml 
fractions were collected, vacuum dried, and finally 
dissolved in 0.5 ml distilled water. 10 M l from 
each fraction was assayed for antifungal activity. 
Peaks 1, 2 and 3 yielded single peaks of antifungal 
activity (shaded black) which eluted at 
approximately 20% acetonitrile . The active 
fractions in peaks 1, 2 and 3 are designated Ib- 
AMPl, Ib-AMP2 and Ib-AMP3 respectively. Peak 4 
yielded two peaks of antifungal activity, the 
second larger peak is designated Ib-AMP4 (shaded 
black in Figure 5) . The first peak probably 
represents some carry over from peak 3 on the FPLC. 
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EXAMPLE 3 

Purification of antimicrobial proteins from 
Aralia ehinensis seeds. 

The basic protein fraction was extracted from 
A Sinensis seeds (purchased from Sandeman Seeds, 
Pulborough, Sussex, UK) using the method described 
in Example 2. This protein fraction was then 
further purified using the method described in 
Example 2 . 

Following chromatography on the S-Sepharose 

High Performance column, the Aralia extract yielded 

two peaks of antifungal activity eluting at 

approximately 400 mM (peak 1) and 500 mM (peak 2) 

NH Ac. Results are shown in Figure 6, with the two 
4 

active peaks shaded in black. 

Active fractions were pooled for each peak and 
further purified on reverse-phase HPLC as described 
in Example 2. Results for peak 1 are shown in 
Figure 7: it yielded an active factor eluting at 
approximately 20% acetonitrile which is designated 
Arc-AMPl. Similarly peak 2 eluted to a single peak 
of activity which is designated Arc-AMP2 (results 
shown in Figure 8) . 



EXAMPLE 4 
Molecular structure of the purified 
antimicrobial proteins 

The molecular structure of the purified 
antimicrobial proteins was further analysed by 
sodium dodecyl sulphate polyacrylamide gel 
electrophoresis (SDS-PAGE) . SDS-PAGE was performed 
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on precast commercial gels (PhastGel High Density 
from Pharmacia) using a PhastSystem (Pharmacia) 
electrophoresis apparatus. The sample buffer 
contained 200 mM Tris-HCl (p H 8.3), 1% ( w/v) SDS/ ± 
mM EDTA, 0.005% bromophenol blue and, unless 
otherwise stated, 1% (w/v) dithioerythritol (DTE) 
Proteins were visualised after diffusion blotting 
to nitrocellulose followed by silver staining 
Molecular weight markers (Pharmacia) and purified 
M j -AMP2 (4 kDa protein from Mirabi 1 i s jalapa seeds) 
were run for comparison. 

Ib-AMPl and Arc-AMPl were analysed by 
SDS-PAGE. Both proteins run as approximately 3 kDa 
bands when run reduced and also when run 
non-reduced. The results show that the peptides 
are single chain polypeptides. 



EXAMPLE 5 

Antifungal potency of the antimicrobial 
proteins 

The antifungal potency of the purified 
proteins was assessed on different plant pathogenic 
fungi, using the assay described in Example 1. 
Growth of fungi, collection and harvest of fungal 
spores were done as previously described (Broekaert 
et al, 1990, FEMS Microbiol Lett, 69:55-60). 
The following fungal strains were used: Alters, 
l0 " qipeS C BS62083, Bipolaris mavdis HM-10, Bot^ytis 
cinerea MUCL 30158, Cercospora beticola strain 
K897 ' Colletnrrirh.im qraminirnl =. CG-17, 
Cladosporitim sphaero.^p^m,^ K 079l, Fusarinm 

culmorum IMI 180420 Pu^arinm • 

' fusarium qraminearum FR-12, 
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Fusariurn mnnil i forme FM-9, Penicillium diqitatum 
(K0879) , g phanplotheca reiliana HS, Septoria 
f.ritici (K1097D) , fit-wnraroella maydis, Trichoderma 
v i r ide K1127, Vprt-.icillium albo-atrum K0937, 
yprricillium dahliae MUCL 19210. 

Serial dilutions of the antifungal proteins 
were applied to the fungi, either using growth 
medium A (half strength potato dextrose broth, 1/2 
PDB) or medium B (medium A supplemented with 1 mM 
CaCl 2 and 50 mM KCl) . The percent growth 
inhibition was measured by microspectrophotometry . 
The concentration required for 50% growth 
inhibition after 48 h of incubation (IC 50 value) 
was calculated from the dose-reponse curves. 

The results for Ib-AMPl, Ib-AMP2, Ib-AMP3 and 
Ib-AMP4 are summarised in Table 1. The results for 
Arc-AMPl and Arc-AMP2 are summarised in Table 2. 
All six peptides show broad spectrum activity 
against the pathogens tested. In the low- ionic 
strength medium (medium A) the IC 5Q values are 
generally below 10 ng/ml. The activity of the 
peptides is sensitive to the ionic conditions used 
in the assay and in high salt medium (medium B) 
their activity is reduced. However, even in medium 
B, the most basic of the Ib-AMP peptides (Ib-AMP4) 
still exhibits fairly strong activity on some of 
the fungi tested. 
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TABLE 1 

ANTIFUNGAL ACTIVITY OF THE lb- AMP s 



FUNGUS 



MEDIUM A 



A lonaipes 

B cinerea 

B mavdis 

£ beticola 

Q araminicol a 

£ SDhaer ospennum 

Z culmorum 

Z qraminearnm 

F moniliformp 

£ digitatum 

S mavdis 

S reiliana 

S tritici 

T viride 

V albo-atrum 

V dahliae 



Ib-AMPl 



IC50 (Mg/ml) 
Ib-AMP2 Ib-AMP3 



3 


12 


6 


12 


25 


6 


7 


nd 


nd 


1 . 5 


3 


nd 


2 


nd 


nd 


1 . 5 


6 


3 


1.5 


6 


6 


3 


nd 


nd 


20 


nd 


nd 


3 


6 


3 


5 


nd 


nd 


4 


nd 


nd 


1 


nd 


nd 


6 


12 


12 


3 


12 


6 


1 


nd 


nd 



lb -AMP 4 



3 
6 

nd 

nd 

nd 

1 

1 

nd 
nd 
3 

nd 

nd 

1 

6 

6 

nd 



MEDIUM B 

h lonaipes 50 

B cinerea >200 

£ sphaerospermuTTi 50 

F culmorum 50 

R digitatum 200 

S tritici 50 

T viride >2 00 

V albo-atrum >200 

nd - not determined 



>200 
>200 
>200 
>200 
>200 

nd 
>200 
>200 



>200 
>200 
100 
100 
100 
nd 
>200 
>200 



12 
200 
6 
6 

25 
12 
150 
50 
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TABLE 2 

ANTIFUNGAL ACTIVITY of Arc- AMP 1 and Arc -AMP 2 



Fungus 


Mediun 
ArcAMPl 


TC50 (uaJ 

i A 
Arc AMP 2 


r ml) 

Mediun 
ArcAMPl 


i B 
ArcAMP2 


B cinerea 

C sphaerospermum 

F culmorum 

p digitatum 

V dahliae 


10 
1.5 
1 
2 
1 


8 
1 
1 
3 

nd 


>100 
100 
100 

>100 
50 


>100 
100 
100 

>100 

nd 



nd = not determined 



EXAMPLE 6 

Anti-bacterial and anti-yeast activity of 
Ib-AMPl and Arc-AMPl 

The purified proteins were assessed for their 
effect on the growth of the following bacteria: 
Bacillus meaaterium ATCC 13 63 2 and Escherichia coli 
strain HB101. The proteins were also assessed for 
their effect on the growth of Saccharo mvces 
cerevisiae JRY188 and Candida albicans KA-1. 
Bioassays were carried out as described in Example 
1. The results are summarised in Table 3. Both 
proteins strongly inhibited the growth of B 
meaaterium and S cerevisiae but had little or no 
effect on the growth of E coli. Ib-AMPl also 
strongly inhibit ied the growth of C albicans. 
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TABLE 3 

Activity on bacteria and yeast 



Fungus 


IC50 (ng/ml) 
Ib-AMPl | Arc-AMPl 


B megaterium 
E coli 

S cerevisiae 
C albicans 


10 
>800 
20 
10 


15 
>500 
30 
nd 



nd - not determined 



EXAMPLE 7 
Amino acid sequencing of Ib-AMPs. 

Cysteine residues were modified by 
S-pyridylethylation using the method of Fullmer 
(1984, Anal Biochem, 142, 336-341). Reagents were 
removed by HPLC on a Pep-s (porous silica C /C ) 
(Pharmacia) column (25 x 0.4 cm). The 2 18 

S-pyridylethylated proteins were 
recovered by eluting the column with a linear 
gradient from 0.1 % trif luoroacetic acid (TFA) to 
acetonitrile containing 0.1 % TFA. The resulting 
protein fractions were subjected to amino acid 
sequence analysis in a 4 77A Protein Sequence 
(Applied Biosystems) with on-line detection of 
phenylthiohydantoin amino acid derivatives in a 
120A Analyser (Applied Biosystems) . 

Initial attempts to sequence the Ib-AMPs 
showed that all four peptides were N-terminally 
blocked. in order to obtain their sequence, each 
of the peptides was digested with either trypsin or 
chymotrypsin. The resulting peptide fragments 
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were purified by RP-HPLC and sequenced. In each 
case one peptide fragment (the N-terminus of the 
protein) was found to be blocked, preventing 
complete sequencing of the Ib-AMP proteins. 

Digestion of Ib-AMPl with trypsin gave four 
fragments from which the partial sequence of 
Ib-AMPl (SEQ ID NO 1) was obtained. Fragment IblTl 
was 16 amino acid long and contained the majority 
of Ib-AMPl (as shown in FIGURE 9) . The other three 
peptide fragments represented further cleavage of 
peptide IblTl . In addition, sequencing of a 
peptide (IblCl) generated by digestion of Ib-AMPl 
with chymotrypsin allowed two further amino acids 
to be assigned to the N-terminus. 

The partial sequences for Ib-AMP2 (SEQ ID NO 
2), Ib-AMP3 (SEQ ID NO 3) and Ib-AMP4 (SEQ ID NO 4) 
were assembled in a similar way from sequences of 
peptide fragments generated by chymotrypsin 
digestion (FIGURE 9) . Only two fragments from Ib- 
AMP3 were sequenced and so the sequence shown in 
Figure 9 (SEQ ID NO 3) represents only the 11 amino 
acids from the C-terminus of this peptide. 

In order to estimate the full-length of the 
Ib-AMPs, the molecular weight of Ib-AMPl was 
determined by electrospray mass spectrometry and 
found to be 24 6 6 Da. The molecular weight of the 
18 amino acids assigned by amino acid sequencing is 
2172 Da suggesting that the full-length peptide is 
only 2 or 3 amino acids longer. 

All the four Ib-AMP peptides are very close 
homologues of each other with only a few amino acid 
substitutions between them. Searches of protein 
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databases have failed to find any other proteins 
with significant homology to the Ib-AMP sequences. 
However, a small part of the Ib-AMP sequences, the 
region GPGRRY (SEQ ID NO 5), has been found in a 
number of proteins (including viral coat proteins) 
and has been shown to be involved in forming a 
S-turn. 



EXAMPLE 8 

Molecular cloning of the Impatlena cDNA 

Total RNA was extracted from dry Impatign.o 
balsamina seeds using the method of Jepson et al 
(1991, Plant Mol Biol Reporter, 9(2)). From 30 g 
seed, 5.9 mg of total RNA was recovered. 
Approximately 3 mg of total RNA was used to purify 
Poly (A)+ mRNA using the PolyAtrack Kit 
(Stratagene) . This yielded approximately 3 0 M g of 
mRNA of which half was used for cDNA synthesis 
using Stratagene' s lambda ZAPII phage vector kit 
according to the manufacturers instructions. 
Synthesised cDNA was size fractionated into 3 
fractions; up to 6 Kb, up to 4 Kb and up to 2 Kb. 

A DNA probe for screening the library was 
produced by polymerase chain reaction (PCR) using 
the synthesised cDNA fractions as template and two 
degenerate oligos based on the available peptide 
sequence for Ib-AMPl. The sequences of the PCR 
primers were : 

IbAMPl-C (5'-GITGT/CTGT/CCGITGGGGICC-3') (SEQ ID NO 

6) and 

IbAMPl-B (5' -CACCAICT/GIACG/ACAG/ATA-3 ' ) (SEQ ID NO 

7) . 
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A PGR product of 50 bp was purified by 
polyacrylamide gel electrophoresis, random labelled 
and used to probe the library. Approximately 
160,000 plaques were probed and 15 positives were 
obtained from this primary round of screening. 
These 15 plaques were purified by two further 
rounds of screening using the same DNA probe. 
Inserts from the purified plaques were excised in 
vivo into the pBluescript phagemid with the aid of 
a helper phage (VCSM13) . Inserts were removed by 
digestion with Xho l and EcoRl and their sizes 
compared on agarose gels. The sizes of the inserts 
varied from approximately 600 bp to 1300 bp. 
Fourteen clones were subjected to nucleotide 
sequencing. Clone Ib22 was fully sequenced and 
shown to contain a complete gene sequence with an 
open reading frame of 333 amino acids containing 
six homologous repeats encoding all four of the 
isolated Ib-AMPs. Other clones were found to be 
either identical to clone Ib22 or truncated 
versions of the full-length gene. 

Figure 10 shows the nucleotide sequence (SEQ 
ID NO 8) of the lb- AMP cDNA (clone Ib22) and the 
predicted amino acid sequence (SEQ ID NO 9) of the 
encoded protein which contains the repeated 
sequences (underlined in Figure 10) . By comparing 
this predicted sequence to the Ib-AMP sequences 
determined by direct peptide sequencing, the 
complete Ib-AMP sequences were identified. It was 
found that the first repeat encodes Ib-AMP3 (SEQ ID 
NO 12) ; the second, third and fourth repeats each 
encode Ib-AMPl (SEQ ID NO 10) ; the fifth repeat 
encodes Ib-AMP2 (SEQ ID NO 11); and the sixth 
repeat encodes Ib-AMP4 (SEQ ID NO 13) . 
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Thus- the Ib-AMP gene contains three repeats of 
Ib-AMPl and one each of Ib-AMP2, Ib-AMP3 and Ib- 
AMP4. The gene also contains a predicted signal 
sequence of approximately 25 amino acids and seven 
propeptide domains which are removed during 
processing of the precursor protein. Thus the 
structure of the polyprotein encoded by the Ib-AMP 
gene (Figure 10) is as follows: 
N- terminus --signal peptide (SEQ ID NO 21)-- 
--propeptide domain (SEQ ID NO 19)-- 
--Ib-AMP3 encoding region (SEQ id NO 12)-- 
- -propeptide domain (SEQ ID NO 14)-- 
--Ib-AMPl encoding region (SEQ ID NO 10)-- 
--propeptide domain (SEQ ID NO 15)-- 
--Ib-AMPl encoding region (SEQ ID NO 10)-- 
- -propeptide domain (SEQ id NO 16)-- 
--Ib-AMPl encoding region (SEQ ID NO 10)-- 
--propeptide domain (SEQ ID NO 17)-- 
--Ib-AMP2 encoding region (SEQ ID NO 11)-- 
--propeptide domain (SEQ ID NO 18)-- 
--Ib-AMP4 encoding region (SEQ ID NO 13)-- 
--propeptide domain (SEQ ID NO 20) - -C-terminus . 
Five of the propeptide domains (SEQ ID NO 14 to SEQ 
ID NO 18) are "spacers" separating two Ib-AMP 
encoding regions. These spacers are cleaved at 
either end during post-translational processing of 
the polyprotein. The N-terminal propeptide domain 
(SEQ ID NO 19) is cleaved from the signal peptide 
(SEQ ID NO 21) at one end and from the Ib-AMP3 
encoding region at the other end. The C-terminal 
propeptide region (SEQ ID NO 20) is cleaved from 
the Ib-AMP4 encoding region at one end. 
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EXAMPLE 9 

Construction of the plant expression vector 
pIB6 containing the Ib-AMP cDNA 

The plant expression vector pIB6 was 
constructed using the pUC based vector pMJBl which 
carries an enhanced 35S promoter, TMV leader 
sequence and the Nos terminator. An Ncol site was 
introduced using PCR at the start of the open 
reading frame in the Ib-AMP cDNA and the Ib-AMP 
cDNA subsequently cloned into pMJBl at the Ncol and 
Sma l sites to create pIB6 (FIGURE 11) . 



EXAMPLE 10 

Construction of plant transformation vector 
pBinIB6 

The expression vector pIB6 was digested with 
Hindi I I and EcoRl and the fragment containing the 
Ib-AMP expresssion cassette was subcloned into 
pBinl9Ri. pBinl9Ri is a modified version of the 
plant transformation vector pBinl9 (Bevan 1984, 
Nucleic Acids Research 12, 8711-8721) wherein the 
unique EcoRl and Hind i I I sites are switched and the 
defective npt ll expression cassette (Yenofsky et 
al , 1990, Proc. Natl. Acad. Sci . USA 87, 3435-3439) 
is introduced. The new plant transformation vector 
is designated pBinlBG (FIGURE 12) . 
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EXAMPLE 11 
Plant Transformation 

The disarmed Agrobacferi nm tumefaciens strain 
LBA4404 (pAL4404) (Hoekema et al, 1983 , Nature 303 
179-180) was transformed with the vector P BinIB6 ' 
using the method of de Framond et al (Biotechnology 
1, 262-269). Tobacco transformation was carried 
out using leaf discs of Nicotian, tabacum Samsun 
and co-culturing with Agrobacteri m containing 
P B ln iB6. A selection pressure of loo ^g/ m l 
kanamycin was present during the co-cultivation 
Transgenic plants transformed with P BinIB6 were 
regenerated on media containing 100 M g/ m l 
kanamycin . 

Transgenic plants are analysed for expression 
of the introduced gene using standard western 
blotting techniques and plants capable of 
constitutive expression are selected and self- 
pollinated to give seed. Fl seedlings of 
transgenic plants are further analysed and 
progressed to select plants homozygous for the lb- 
AMP gene. 



EXAMPLE 12 

Making a ONA construct encoding a polyprotein 

n Ma ^ artifiCial gene is constructed using the 

SeqUenCe ^coding the antimicrobial protein 
RS-AFP2 (international Patent Application 
Publication Number WO93/05153) . The artificial 
gene comprises three copies of the Rs-AFP2 encoding 
sequence linked by two spacer regions, each spacer 
region encoding a propeptide linker having the 
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sequence of any one of SEQ ID NO 14 to SEQ ID NO 
18 . 

The artificial gene is constructed from the 
RS-AFP2 sequence by introducing a suitable 
restriction enzyme site at the 3' end of the mature 
protein sequence, into which oligonucleotides 
encoding the linker peptide and further copies of 
the DNA encoding the 51 amino acid Rs-AFP2 peptide 
sequence are introduced. Each subsequent copy of 
the RS-AFP2 peptide sequence is linked to the 
previous one using the linker peptide sequence. 
The final copy of the Rs-AFP2 encoding sequence is 
followed by a stop codon. The first copy of the 
Rs-AFP2 encoding sequence may be preceded by a 
signal sequence . 

The artificial gene is then used to construct 
a plant expression construct suitable for plant 
transformation. A plant -operative promoter drives 
expression of the artificial gene so that the 
polyprotein is produced and subsequently processed 
to release three copies of the RS-AFP2 protein. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 

(A) NAME: ZENECA Limited 

(B) STREET : 15 Stanhope Gate 

(C) CITY: London 

(D) STATE: England 

(E) COUNTRY: UKI 

(F) POSTAL CODE (ZIP) : W1Y 6LN 

(ii) TITLE OF INVENTION: ANTIMICROBIAL PROTEINS 
(iii) NUMBER OF SEQUENCES: 21 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
<B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patent In Release #i. 0/ Version #1.25 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9404807.1 

(B) FILING DATE: ll-MAR-1994 

(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: FIGURE 9, IB-AMPl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

Gly Arg Arg Cys Cys Gly Trp Gly Pro Gly Arg Arg Tyr Cys Val Arg 
Trp Cys 



(2) INFORMATION FOR SEQ ID NO : 2: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: FIGURE 9: IB-AMP2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Gly Arg Arg Cys Cys Asn Trp Gly Pro Gly Arg Arg Tyr Cys Lys Arg 
15 10 15 

Trp Cys 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: FIGURE 9: IB-AMP3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

Gly Pro Gly Arg Lys Tyr Cys Lys Arg Trp Cys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: FIGURE 9: IB-AMP4 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4: 



BNSDOCID: <WO 9524486A1_L> 



WO 95/24486 

PCT/GB95/00509 

35 



Oly Arg ^ g cys Cys oi y Trp Gly Pro „ y Arg ^ ^ ^ ^ ^ 

Trp Cys 



10 15 



(2) INFORMATION FOR SEQ ID NO: 5; 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID 

Gly Pro Gly Arg Arg Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IB-AMP1-C 

(xi) SEQUENCE DESCRIPTION : SEQ ID ^ 
GNTGTCTGTC CGNTGGGGNC C 
(2) INFORMATION FOR SEQ ID NO: 7; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IB-AMPl-B 
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<xi> SEQUENCE DESCRIPTION: SEQ ID NO : 7: 
CACCANCTGN ACGACAGATA 
(2) INFORMATION FOR SEQ ID NO : 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1230 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: FIGURE 10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
ATTTTTAGGT GAGGAAAAAT GGTCCAAAAA GGTGTAGTCT TTGGGGTGCT CCTAATTCTC 
TTCATCTGCT CTACGCTCAC TTCGGCCGAT TCGAAGCCAA ACCCTACGAA AGAGGAAGAA 
CCAGCGAAGA AACCGGATGA GGTCAGCGTA AAGAGCGGTG GACCGGAGGT GTCGGAGGAT 
CAATACCGT C ATCGGTGCTG CGCTTGGGGA CCTGGGCGAA AATATTGCAA GCGGTGGTGT 
GCTAACGCTG AAGAGGCGGC GGCCGCAATC CCCGAGGCAA GTGAAGAATT AGCTCAGGAG 
GAGGCTCCGG TGTACTCGGA GGATCAGTGG GGTCGTCGGT GCTGCGGCTG GGGACCCGGC 
CGAAGATACT GCGTGCGCTG GTGTCAAAAC GCGGAAGAGG CGGCCGCGGC AATCCCCGAG 
GCGACTGAAA AAGCTCAGGA GGCTCCGGTG TACTCGGAGG ATCAGTGGGG TCGTCGATGC 
TGCGGCTGGG GACCCGGCCG ACGGTATTGC GTGCGCTGGT GTCAAAACGC GGAAGAGGCG 
GCCGCGGCGG TGGCAATCCC CGAGGCAAGT GAGAAAGCTC AGGAGGGACC CGTGTACTCG 
GAGGATCAGT GGGGTCGCCG ATGCTGCGGT TGGGGACCTG GCCGTAGGTA TTGCGTGCGG 
TGGTGCAGCA ACGCCGCCGA CGAGGTGGCA ACACCCGAGG ACGTAGAACC GGGTCAGTAC 
GGTCGTCGGT GCTGCAACTG GGGACCTGGG CGAAGGTATT GCAAGCGGTG GTGTCATAAT 
GCGGCTGAAG AGGCAACTCT CAAGGCATTT GAAGAGGAAG CAGCTCGGGA GCAACCGGTG 
TACTCGGAGG ACCAGTGGGG TCGCCGGTGC TGCGGTTGGG GACCCGGCCG TAGGTACTGC 
AGGCGGTGGT GTCAAAGCGC CGAAGAAGCG GCTGCGTTCC AGGCTGGGGA GGTAACTGCT 
TCCTTGATGC TCATCATGTT TAAGGCATGC CCATGCATGG GGCCGGTGCC TTCTGTTTAA 1020 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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GGCCACTCTA GCTAGCTACG TACTCTTAAT AAGGG CACAT GAAAAAGTTT G T C CTTTAGA 
AATAAGGCAC AGTAAGAAAT AAAATGTCCA ACTTCTTTTA TGAAAGAAGT GAACAATAAG 
TGTAAGCTGA ATAATATATA TTGTGACACG TTTGTTGTTG TACAAAAATA ACATCTTTTC 
AGATGAACAA CCTTTAATGG AAAAAAAAAA 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: FIGURE 10 



1080 
1140 
1200 
1230 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Val Gin Lys Gly Val Val Phe Gly Val Leu Leu He Leu Phe lie 

5 10 15 

Cys Ser Thr Leu Thr Ser Ala Asp Ser Lys Pro Asn Pro Thr Lys Glu 

20 25 30 

Glu Glu Pro Ala Lys Lys Pro Asp Glu Val Ser Val Lys Ser Gly Gly 



40 



45 



Pro Glu Val Ser Glu Asp Gin Tyr Arg His Arg Cys Cys Ala Trp 



60 



Gly 



Pro Gly Arg Lys Tyr Cys Lys Arg Trp Cys Ala Asn Ala Glu 



70 



75 



Glu Ala 
80 



Ala Ala Ala lie Pro Glu Ala Ser Glu Glu Leu Ala Gin Glu Glu Ala 
85 90 



95 



Pro Val Tyr Ser Glu Asp Gin Trp Gly Arg Arg Cys Cys Gly Trp 



105 



Gly 



110 



Pro Gly Arg Arg Tyr Cys Val Arg Trp Cys Gin Asn Ala Glu Glu Ala 



120 



125 



Ala Ala Ala He Pro Glu Ala Thr Glu Lys Ala Gin Glu Ala Pro 



135 



Val 



140 



Tyr Ser Glu Asp Gin Trp Gly Arg Arg Cys Cys Gly Trp Gly Pro Gly 

155 160 
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Arg Arg Tyr Cys Val Arg Trp Cys Gin Asn Ala Glu Glu Ala Ala Ala 
165 170 175 

Ala Val Ala lie Pro Glu Ala Ser Glu Lys Ala Gin Glu Gly Pro Val 
180 185 190 

Tyr Ser Glu Asp Gin Trp Gly Arg Arg Cys Cys Gly Trp Gly Pro Gly 
195 200 205 

Arg Arg Tyr Cys Val Arg Trp Cys Ser Asn Ala Ala Asp Glu Val Ala 
210 215 220 

Thr Pro Glu Asp Val Glu Pro Gly Gin Tyr Gly Arg Arg Cys Cys Asn 
225 230 235 240 

Trp Gly Pro Gly Arg Arg Tyr Cys Lys Arg Trp Cys His Asn Ala Ala 
245 250 255 

Glu Glu Ala Thr Leu Lys Ala Phe Glu Glu Glu Ala Ala Arg Glu Gin 
260 265 270 

Pro Val Tyr Ser Glu Asp Gin Trp Gly Arg Arg Cys Cys Gly Trp Gly 
275 280 285 

Pro Gly Arg Arg Tyr Cys Arg Arg Trp Cys Gin Ser Ala Glu Glu Ala 
290 295 300 

Ala Ala Phe Gin Ala Gly Glu Val Thr Ala Ser Leu Met Leu He Met 
305 310 315 320 

Phe Lys Ala Cys Pro Cys Met Gly Pro Val Pro Ser Val 
325 330 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE " TYPE : peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IB-AMP1 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gin Trp Gly Arg Arg Cys Cys Gly Trp Gly Pro Gly Arg Arg Tyr Cys 
1 5 10 15 

Val Arg Trp Cys 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: IB-AMP2 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Gin Tyr Gly Arg Arg Cys Cys Asn Trp Gly Pro Gly ^ g ^ ^ ^ 



io 15 



Lys Arg Trp Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IB-AMP3 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12: 

am Tyr „ is ^ c ^ Ua Trp Gly pro ^ ^ ^ ^ 



Lys Arg Trp Cys 
20 



10 15 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IB-AMP4 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 

Gin Trp Gly Arg Arg Cys Cys Gly Trp Gly Pro Gly Arg Arg Tyr Cys 
! 5 10 15 

Arg Arg Trp Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ala Asn Ala Glu Glu Ala Ala Ala Ala He Pro Glu Ala Ser Glu Glu 
15 10 15 

Leu Ala Gin Glu Glu Ala Pro Val Tyr Ser Glu Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 15: 

Gin Asn Ala Glu Glu Ala Ala Ala Ala He Pro Glu Ala Thr Glu Lys 
1 5 10 15 

Ala Gin Glu Ala Pro Val Tyr Ser Glu Asp 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Gin Asn Ala Glu Glu Ala Ala Ala Ala Val Ala He Pro Glu Ala Ser 

5 10 15 

Glu Lys Ala Gin Glu Gly Pro Val Tyr Ser Glu Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17 : 

Ser Asn Ala Ala Asp Glu Val Ala Thr Pro Glu Asp Val Glu Pro Gly 

10 15 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

His Asn Ala Ala Glu Glu Ala Thr Leu Lys Ala Phe Glu Glu Glu Ala 
5 10 15 
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Ala Arg Glu Gin Pro Val Tyr Ser Glu Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO : 19: 

L ys Pro Asn Pro Thr Lys Glu Glu Glu Pro Ala Lye Lye Pro Asp Glu 
1 5 10 

Val Ser Val Lys Ser Gly Gly Pro Glu Val Ser Glu Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Gin Ser Ala Glu Glu Ala Ala Ala Phe Gin Ala Gly Glu Val Thr Ala 



1 



Ser Leu Met Leu lie Met Phe Lys Ala Cys Pro Cys Met Gly Pro Val 



25 30 
20 2b 



Pro Ser Val 
35 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Val Gin Lys Gly Val Val Phe Gly Val Leu Leu He Leu Phe He 



5 10 



15 

Cys Ser Thr Leu Thr Ser Ala Asp Ser 
20 25 
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CLAIMS 



An antimicrobial protein of about 3 kDa, 
capable of being isolated from seeds of 
Aralia or Tmpatiens . 

An antimicrobial protein as claimed in claim 1 
which is selected from the group consisting of 
Arc- AMP 1 and Arc-AMP2. 

An antimicrobial protein as claimed in claim 1 
which is selected from the group consisting of 
Ib-AMPl, Ib-AMP2, Ib-AMP3 and Ib-AMP4 . 

An antimicrobial protein as claimed in claim 1 
having an amino acid sequence which is 
substantially homologous to a sequence 
selected from the group consisting of SEQ ID 
NO 10, SEQ ID NO 11, SEQ ID NO 12 and SEQ ID 
NO 13 . 

Recombinant DNA encoding an antimicrobial 
protein as claimed in any one of claims 1 to 
4 . 

Recombinant DNA as claimed in claim 5 having a 
nucleotide sequence which is homologous to a 
run of bases contained in SEQ ID NO 8. 

A biological system containing DNA as claimed 
in claim 5. 



A biological system as claimed in claim 7 
which is a micro-organism. 
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9. A biological system as claimed in claim 7 
which is a plant. 

10. a plant having improved resistance to a fungal 
or bacterial pathogen and containing 
recombinant DNA which expresses an 
antimicrobial protein as claimed in claim 1. 

11. A process of combating fungi or bacteria 
comprising exposure of the fungi or bacteria 
to an antimicrobial protein as claimed in 
claim 1. 



12. A method for the expression of multiple 

proteins in a transgenic plant comprising 
inserting into the genome of the plant a gene 
construct comprising a 5' -region which 
includes a promoter which is capable of 
initiating transcription of a structural gene 
under the control thereof, at least two 
protein encoding sequences and a 3 ' -terminator 
region which includes a polyadenylation 
signal, each of the said protein encoding 
sequences being separated from an adjacent 
protein encoding sequence by a DNA sequence 
which on translation provides a cleavage site 
whereby the expressed polyprotein is post- 
translationally processed into the component 
protein molecules, wherein at least one of the 
DNA sequences providing the cleavage site 
encodes an amino acid sequence selected from 
the group consisting of SEQ ID NO 14, SEQ ID 
NO 15, SEQ ID NO 16, SEQ ID NO 17 and SEQ ID 
NO 18. 
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13 . A method as claimed in claim 12 wherein at 
least one of the DNA sequences providing the 
cleavage site is derived from SEQ ID NO 8 . 

14. A gene construct for the use in the genetic 
modification of plants comprising in sequence, 
a gene promoter active in plant cells, a 
plurality of protein encoding regions and a 

3 ' -non-translated region containing a 
polyadenylation signal, wherein each of the 
plurality of protein encoding regions is 
separated by a DNA sequence which on 
translation provides a cleavage site whereby 
the expressed polyprotein is post- 
translationally processed into the component 
protein molecules and at least one of the DNA 
sequences providing the cleavage site encodes 
an amino acid sequence selected from the group 
consisting of SEQ ID NO 14, SEQ ID NO 15, SEQ 
ID NO 16, SEQ ID NO 17 and SEQ ID NO 18. 

15. A gene construct as claimed in claim 14, in 
which at least one of the DNA sequences 
providing the cleavage site is derived from 
SEQ ID NO 8 . 
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FIG. 1 
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FIG. 2 
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FIG. 3 
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FIG. 4 
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FIG. 5 
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FIG. 8 
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